منابع مشابه
Switchboard Language Model Improvement with Conversational Data from Gigaword
........................................................................................................... 2 INTRODUCTION.................................................................................................. 2 1 THE STATE OF THE ART OF TEXT CLASSIFICATION ....................... 3 2 BUILDING A UNIGRAM CLASSIFIER...................................................... 5 2.
متن کاملDisfluencies in Switchboard
Disfluencies (“um,” repeats, self-repairs) are prevalent in spontaneous speech, and are relevant to both human speech communication and speech processing by machine. Although disfluencies have commonly been viewed as ‘noisy’ events, results from a large descriptive study indicate that disfluencies show regularities in a number of dimensions [9]. This paper reports selected results on Switchboar...
متن کاملResegmentation of SWITCHBOARD
The SWITCHBOARD (SWB) corpus is one of the most important benchmarks for recognition tasks involving large vocabulary conversational speech (LVCSR). The high error rates on SWB are largely attributable to an acoustic model mismatch, the high frequency of poorly articulated monosyllabic words, and large variations in pronunciations. It is imperative to improve the quality of segmentations and tr...
متن کاملInsights into Spoken Language Gleaned from Phonetic Transcription of the Switchboard Corpus
Models of speech recognition (by both human and machine) have traditionally assumed the phoneme to serve as the fundamental unit of phonetic and phonological analysis. However, phoneme-centric models have failed to provide a convincing theoretical account of the process by which the brain extracts meaning from the speech signal and have fared poorly in automatic recognition of natural, informal...
متن کاملEstimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Developing resources which can be used for Natural Language Processing is an extremely difficult task for any language, but is even more so for less privileged (or less computerized) languages. One way to overcome this difficulty is to adapt the resources of a linguistically close resource rich language. In this paper we discuss how the cost of such adaption can be estimated using subjective an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Linguistics
سال: 2018
ISSN: 0891-2017,1530-9312
DOI: 10.1162/coli_a_00329